CDS

Accession Number TCMCG064C14986
gbkey CDS
Protein Id XP_011081405.1
Location complement(join(11503669..11503833,11504193..11504315,11504381..11504495,11506115..11506213,11506299..11506351,11508356..11508512,11509251..11509309,11509426..11509493,11509577..11509720,11510860..11510923,11512277..11512327,11512410..11512549,11512626..11512765,11513654..11513739,11513812..11513957,11514061..11514133,11514950..11515025,11515681..11515788,11516059..11516105,11517491..11517536,11518300..11518412,11518497..11518532,11518610..11518813,11519933..11519998))
Gene LOC105164452
GeneID 105164452
Organism Sesamum indicum

Protein

Length 792aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268358
db_source XM_011083103.2
Definition DNA mismatch repair protein MSH4 [Sesamum indicum]

EGGNOG-MAPPER Annotation

COG_category L
Description MutS family domain IV
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K08740        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAAGACGATACAGGAGAGAAATCCAGCTTCATCGTTGGTATCATTGAGAACCGCGCCAAAGAGGTTGGAGTGGCTGCATTTGACTTGAGGTCAGCTTCATTTCACCTTTCTCAGTATATTGAGACGAGCAGCTCATACCAGAACACAAAAACACTACTGAATTTTTATGATCCTATGGTGATTATTGTTACACCTACTAAGCTGGCTCCTGATGGCATGGTTGGAGTTTCTGAGCTGGTTGACAGATTTTTCTCTTCAACCACAAAGGTTATGATGGCTCGCAGTTGCTTTGATGACACCAGGGGTGGTGCTCTAGTTAAAAACTTGGCAGCCAAGGAATCATCTGCTCTTGGTTTGGATACTTATTACAAGCAATATTATCTATGCTTGGCTGCAGCTGCTGCAACCATCAAGTGGATAGAAGCAGAGAAGGGAGTCATCATCACTAACCACTCATTATCTGTTACTTTTAATGGATCATTCGACCACATGAATGTAGACGCCACTAGTGTCCAGAACTTGGAAATCATTGAGCCGATGCACTCTAGTCTTTGGGGCACTAACAACAAGAAGAGAAGTCTATTTCATATGCTCAAGACAACAAGGACCATTGGAGGGACTAGACTATTGCGAGCCAATCTTTTGCAGCCTCTCAAAGACATTGACACTATCAATGCCCGACTTGATTGCCTGGATGAGTTGATGAGCAATGAGCAGCTGTTCTTTGGCTTGTCCCAGGCTCTTCGTATGTTTCCAAAGGAAACTGATAAAGTCCTTTGTCATTTTTGCTTTAAGCAAAAGAAAGTTACTAATGGGGTCCTGGCCATAGACAACTCCAGAAAGAGCCAAATATTGATCTCAAGCATTATTCTTCTTAAAACGGCTCTAGATGCATTACCATTACTCTCCAAGGCGCTTAAGGATGCCAACTGTTTTCTACTCAGAAATGTTTACAAGTCTATATGTGCAAATGAAAAATTTTCTTCCATGAGGGAAAGGATTGGAGAGGTTATTGATGAAGATGTTCTTCATGCACGTGTCCCTTTTGTTGCTCGAACTCAGCAGTGTTTTGCTGTCAAGGCAGGAATTGATGGGCTACTAGATATTGCACGCAGATCTTTCTGTGACACCAGTGAAGCGATACATAACTTGGCCAACAAGTACCGAGAGGATTTCAAGCTGCCGAATTTGAAAATCCCATACAACAAAAGGCAAGGTTTCTACTTTAGCATACCTCAGAAGGATATACAGGGAAAACTTCCCAGCAAGTTCATCCAGGTCATAAAACATGGAAACAACATACATTGCTCTTCTCTGGAACTGGCCTCTCTGAATGTAAGGAACAAGTCTGCAGCTAAAGAGTGCTATATCCGGACTGAATTTTGCCTGGAAGCCTTAATGGATGCCATAAGGGAGGATGTCTCTGTGCTCACTCTTCTAGCGGAGGTGTTATGCCTTCTGGATATGATAGTTAATTCATTTGCTCATACAATATCCATGAAGCCAGTTGACAAATATACTAGACCTCAGTTTACGTATGATGGTCCATTGGCAATTGATTCAGGAAGGCACCCAATTCTTGAAAGTGTCCACAATGAGTTTGTTGCGAACAACATTTTTCTGTCTGAAGCATCAAATATGATAATTGTTACAGGCCCAAACATGAGTGGAAAGAGTACTTATCTTCAACAAGTTTGCTTGGTAGTCATCCTTGCTCAAATTGGTTGTTACGTTCCCGCCCGCTTTGCAACATTAAGAGTAGTTGATCGCATATTTACTAGGATGGGAACTATGGACAGTGTTGAATCAAATTCCAGCACGTTTATGACAGAGATGAAGGAGACGGCGTTTATCTTGCAAAATGCTTCTCATAGGAGTCTCATTGTTGTGGATGAATTAGGAAGAGCAACATCTTCCTCTGATGGGTTTGCAATTGCATGGAGTTGCTGCGAGCATCTGCTCGCTTTAAGAGCGTACACTATCTTTGCTTCTCATATGGAAAACCTATCTGAATTGGCCACCATTTATCCAAATGTCAAAATTGTTCACTTCGACGTTGAGATTAAGAACAAACGGATGGATTTCAAGTTTCGACTGAAAGATGGGCCGAGGCATGTAGCACACTATGGCCTCATGCTAGCAGGAGTAGCTGGACTGCCAAGTTCTGTGATAGAGTCTGCTCAAAGCATCACATCAAAGATTACCGAAAAGGAAGTGAAGAGAATAGAGCTTAATTTCCAGCAGCATCACGATATTCAAATGGCTTACCGTGTTGCTCAAAGACTCATATGCCTCAAATATTCCAACCAAGATGAGGACTCAATCCGGCAAGCACTGCAGAATCTTAAAGAAAGTTATATTAACGGCGGGCTTTAA
Protein:  
MEDDTGEKSSFIVGIIENRAKEVGVAAFDLRSASFHLSQYIETSSSYQNTKTLLNFYDPMVIIVTPTKLAPDGMVGVSELVDRFFSSTTKVMMARSCFDDTRGGALVKNLAAKESSALGLDTYYKQYYLCLAAAAATIKWIEAEKGVIITNHSLSVTFNGSFDHMNVDATSVQNLEIIEPMHSSLWGTNNKKRSLFHMLKTTRTIGGTRLLRANLLQPLKDIDTINARLDCLDELMSNEQLFFGLSQALRMFPKETDKVLCHFCFKQKKVTNGVLAIDNSRKSQILISSIILLKTALDALPLLSKALKDANCFLLRNVYKSICANEKFSSMRERIGEVIDEDVLHARVPFVARTQQCFAVKAGIDGLLDIARRSFCDTSEAIHNLANKYREDFKLPNLKIPYNKRQGFYFSIPQKDIQGKLPSKFIQVIKHGNNIHCSSLELASLNVRNKSAAKECYIRTEFCLEALMDAIREDVSVLTLLAEVLCLLDMIVNSFAHTISMKPVDKYTRPQFTYDGPLAIDSGRHPILESVHNEFVANNIFLSEASNMIIVTGPNMSGKSTYLQQVCLVVILAQIGCYVPARFATLRVVDRIFTRMGTMDSVESNSSTFMTEMKETAFILQNASHRSLIVVDELGRATSSSDGFAIAWSCCEHLLALRAYTIFASHMENLSELATIYPNVKIVHFDVEIKNKRMDFKFRLKDGPRHVAHYGLMLAGVAGLPSSVIESAQSITSKITEKEVKRIELNFQQHHDIQMAYRVAQRLICLKYSNQDEDSIRQALQNLKESYINGGL